Dataset statistics
| Number of variables | 40 |
|---|---|
| Number of observations | 2939478 |
| Missing cells | 58016800 |
| Missing cells (%) | 49.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 897.1 MiB |
| Average record size in memory | 320.0 B |
Variable types
| Categorical | 16 |
|---|---|
| Numeric | 16 |
| Unsupported | 8 |
id_mutation has a high cardinality: 1274243 distinct values | High cardinality |
date_mutation has a high cardinality: 366 distinct values | High cardinality |
adresse_nom_voie has a high cardinality: 445200 distinct values | High cardinality |
adresse_code_voie has a high cardinality: 14726 distinct values | High cardinality |
nom_commune has a high cardinality: 30417 distinct values | High cardinality |
ancien_nom_commune has a high cardinality: 1772 distinct values | High cardinality |
id_parcelle has a high cardinality: 1820762 distinct values | High cardinality |
ancien_id_parcelle has a high cardinality: 25690 distinct values | High cardinality |
code_nature_culture_speciale has a high cardinality: 124 distinct values | High cardinality |
nature_culture_speciale has a high cardinality: 124 distinct values | High cardinality |
code_postal is highly correlated with ancien_code_commune | High correlation |
ancien_code_commune is highly correlated with code_postal | High correlation |
lot1_surface_carrez is highly correlated with lot4_surface_carrez and 3 other fields | High correlation |
lot2_surface_carrez is highly correlated with lot3_surface_carrez and 4 other fields | High correlation |
lot3_surface_carrez is highly correlated with lot2_surface_carrez and 3 other fields | High correlation |
lot4_surface_carrez is highly correlated with lot1_surface_carrez and 3 other fields | High correlation |
lot5_surface_carrez is highly correlated with lot2_surface_carrez and 2 other fields | High correlation |
code_type_local is highly correlated with nombre_pieces_principales | High correlation |
surface_reelle_bati is highly correlated with lot1_surface_carrez and 3 other fields | High correlation |
nombre_pieces_principales is highly correlated with lot1_surface_carrez and 3 other fields | High correlation |
surface_terrain is highly correlated with lot1_surface_carrez | High correlation |
code_postal is highly correlated with ancien_code_commune | High correlation |
ancien_code_commune is highly correlated with code_postal | High correlation |
lot1_surface_carrez is highly correlated with lot5_surface_carrez | High correlation |
lot2_surface_carrez is highly correlated with lot4_surface_carrez | High correlation |
lot3_surface_carrez is highly correlated with surface_reelle_bati | High correlation |
lot4_surface_carrez is highly correlated with lot2_surface_carrez | High correlation |
lot5_surface_carrez is highly correlated with lot1_surface_carrez | High correlation |
code_type_local is highly correlated with nombre_pieces_principales | High correlation |
surface_reelle_bati is highly correlated with lot3_surface_carrez | High correlation |
nombre_pieces_principales is highly correlated with code_type_local | High correlation |
code_postal is highly correlated with ancien_code_commune | High correlation |
ancien_code_commune is highly correlated with code_postal | High correlation |
lot1_surface_carrez is highly correlated with surface_reelle_bati and 1 other fields | High correlation |
lot2_surface_carrez is highly correlated with surface_reelle_bati and 1 other fields | High correlation |
lot3_surface_carrez is highly correlated with lot4_surface_carrez and 1 other fields | High correlation |
lot4_surface_carrez is highly correlated with lot3_surface_carrez and 1 other fields | High correlation |
lot5_surface_carrez is highly correlated with lot4_surface_carrez | High correlation |
code_type_local is highly correlated with nombre_pieces_principales | High correlation |
surface_reelle_bati is highly correlated with lot1_surface_carrez and 3 other fields | High correlation |
nombre_pieces_principales is highly correlated with lot1_surface_carrez and 3 other fields | High correlation |
code_nature_culture is highly correlated with nature_culture | High correlation |
nature_culture is highly correlated with code_nature_culture | High correlation |
code_type_local is highly correlated with type_local | High correlation |
type_local is highly correlated with code_type_local | High correlation |
numero_disposition is highly correlated with nature_mutation | High correlation |
nature_mutation is highly correlated with numero_disposition | High correlation |
valeur_fonciere is highly correlated with lot3_surface_carrez | High correlation |
adresse_numero is highly correlated with adresse_suffixe and 1 other fields | High correlation |
adresse_suffixe is highly correlated with adresse_numero | High correlation |
code_postal is highly correlated with ancien_code_commune and 1 other fields | High correlation |
ancien_code_commune is highly correlated with code_postal | High correlation |
lot1_surface_carrez is highly correlated with lot3_surface_carrez and 2 other fields | High correlation |
lot2_surface_carrez is highly correlated with lot3_surface_carrez and 1 other fields | High correlation |
lot3_surface_carrez is highly correlated with valeur_fonciere and 4 other fields | High correlation |
lot4_surface_carrez is highly correlated with adresse_numero and 4 other fields | High correlation |
lot5_surface_carrez is highly correlated with lot1_surface_carrez and 1 other fields | High correlation |
nombre_lots is highly correlated with lot3_surface_carrez | High correlation |
code_type_local is highly correlated with type_local | High correlation |
type_local is highly correlated with code_type_local | High correlation |
code_nature_culture is highly correlated with nature_culture | High correlation |
nature_culture is highly correlated with code_nature_culture | High correlation |
longitude is highly correlated with code_postal and 1 other fields | High correlation |
latitude is highly correlated with longitude | High correlation |
valeur_fonciere has 36638 (1.2%) missing values | Missing |
adresse_numero has 1257084 (42.8%) missing values | Missing |
adresse_suffixe has 2816323 (95.8%) missing values | Missing |
ancien_code_commune has 2847076 (96.9%) missing values | Missing |
ancien_nom_commune has 2847076 (96.9%) missing values | Missing |
ancien_id_parcelle has 2907846 (98.9%) missing values | Missing |
numero_volume has 2927451 (99.6%) missing values | Missing |
lot1_numero has 2050131 (69.7%) missing values | Missing |
lot1_surface_carrez has 2700907 (91.9%) missing values | Missing |
lot2_numero has 2747922 (93.5%) missing values | Missing |
lot2_surface_carrez has 2879425 (98.0%) missing values | Missing |
lot3_numero has 2907565 (98.9%) missing values | Missing |
lot3_surface_carrez has 2933591 (99.8%) missing values | Missing |
lot4_numero has 2928634 (99.6%) missing values | Missing |
lot4_surface_carrez has 2938019 (> 99.9%) missing values | Missing |
lot5_numero has 2934306 (99.8%) missing values | Missing |
lot5_surface_carrez has 2938834 (> 99.9%) missing values | Missing |
code_type_local has 1350027 (45.9%) missing values | Missing |
type_local has 1350027 (45.9%) missing values | Missing |
surface_reelle_bati has 1763264 (60.0%) missing values | Missing |
nombre_pieces_principales has 1352568 (46.0%) missing values | Missing |
code_nature_culture has 901816 (30.7%) missing values | Missing |
nature_culture has 901816 (30.7%) missing values | Missing |
code_nature_culture_speciale has 2800349 (95.3%) missing values | Missing |
nature_culture_speciale has 2800349 (95.3%) missing values | Missing |
surface_terrain has 901874 (30.7%) missing values | Missing |
longitude has 112870 (3.8%) missing values | Missing |
latitude has 112870 (3.8%) missing values | Missing |
numero_disposition is highly skewed (γ1 = 35.02589871) | Skewed |
lot1_surface_carrez is highly skewed (γ1 = 27.63008812) | Skewed |
lot2_surface_carrez is highly skewed (γ1 = 33.39231458) | Skewed |
lot3_surface_carrez is highly skewed (γ1 = 38.11228458) | Skewed |
nombre_lots is highly skewed (γ1 = 82.80203865) | Skewed |
surface_reelle_bati is highly skewed (γ1 = 101.1116364) | Skewed |
surface_terrain is highly skewed (γ1 = 121.0253572) | Skewed |
code_commune is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
code_departement is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
numero_volume is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
lot1_numero is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
lot2_numero is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
lot3_numero is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
lot4_numero is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
lot5_numero is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
nombre_lots has 2050131 (69.7%) zeros | Zeros |
nombre_pieces_principales has 504465 (17.2%) zeros | Zeros |
Reproduction
| Analysis started | 2021-10-05 22:34:38.415698 |
|---|---|
| Analysis finished | 2021-10-05 22:48:21.636508 |
| Duration | 13 minutes and 43.22 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 1274243 |
|---|---|
| Distinct (%) | 43.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 22.4 MiB |
| 2016-193380 | 5545 |
|---|---|
| 2016-1154770 | 3756 |
| 2016-1210409 | 2608 |
| 2016-891506 | 2206 |
| 2016-891518 | 1698 |
| Other values (1274238) |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 11.10557147 |
| Min length | 6 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 622975 ? |
|---|---|
| Unique (%) | 21.2% |
Sample
| 1st row | 2016-1 |
|---|---|
| 2nd row | 2016-2 |
| 3rd row | 2016-2 |
| 4th row | 2016-2 |
| 5th row | 2016-2 |
Common Values
| Value | Count | Frequency (%) |
| 2016-193380 | 5545 | 0.2% |
| 2016-1154770 | 3756 | 0.1% |
| 2016-1210409 | 2608 | 0.1% |
| 2016-891506 | 2206 | 0.1% |
| 2016-891518 | 1698 | 0.1% |
| 2016-1172857 | 1570 | 0.1% |
| 2016-962449 | 1397 | < 0.1% |
| 2016-1215741 | 1354 | < 0.1% |
| 2016-760057 | 1253 | < 0.1% |
| 2016-1176057 | 1237 | < 0.1% |
| Other values (1274233) | 2916854 |
Length
| Value | Count | Frequency (%) |
| 2016-193380 | 5545 | 0.2% |
| 2016-1154770 | 3756 | 0.1% |
| 2016-1210409 | 2608 | 0.1% |
| 2016-891506 | 2206 | 0.1% |
| 2016-891518 | 1698 | 0.1% |
| 2016-1172857 | 1570 | 0.1% |
| 2016-962449 | 1397 | < 0.1% |
| 2016-1215741 | 1354 | < 0.1% |
| 2016-760057 | 1253 | < 0.1% |
| 2016-1176057 | 1237 | < 0.1% |
| Other values (1274233) | 2916854 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 366 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 22.4 MiB |
| 2016-12-30 | 31580 |
|---|---|
| 2016-12-29 | 30141 |
| 2016-06-30 | 27563 |
| 2016-12-22 | 25443 |
| 2016-12-16 | 23455 |
| Other values (361) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2016-01-08 |
|---|---|
| 2nd row | 2016-01-11 |
| 3rd row | 2016-01-11 |
| 4th row | 2016-01-11 |
| 5th row | 2016-01-11 |
Common Values
| Value | Count | Frequency (%) |
| 2016-12-30 | 31580 | 1.1% |
| 2016-12-29 | 30141 | 1.0% |
| 2016-06-30 | 27563 | 0.9% |
| 2016-12-22 | 25443 | 0.9% |
| 2016-12-16 | 23455 | 0.8% |
| 2016-09-30 | 23203 | 0.8% |
| 2016-12-21 | 22608 | 0.8% |
| 2016-07-29 | 22504 | 0.8% |
| 2016-12-15 | 20443 | 0.7% |
| 2016-12-20 | 20252 | 0.7% |
| Other values (356) | 2692286 |
Length
| Value | Count | Frequency (%) |
| 2016-12-30 | 31580 | 1.1% |
| 2016-12-29 | 30141 | 1.0% |
| 2016-06-30 | 27563 | 0.9% |
| 2016-12-22 | 25443 | 0.9% |
| 2016-12-16 | 23455 | 0.8% |
| 2016-09-30 | 23203 | 0.8% |
| 2016-12-21 | 22608 | 0.8% |
| 2016-07-29 | 22504 | 0.8% |
| 2016-12-15 | 20443 | 0.7% |
| 2016-12-20 | 20252 | 0.7% |
| Other values (356) | 2692286 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 964 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.111543274 |
| Minimum | 1 |
|---|---|
| Maximum | 1271 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 1271 |
| Range | 1270 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 26.04074981 |
|---|---|
| Coefficient of variation (CV) | 12.33256743 |
| Kurtosis | 1375.699129 |
| Mean | 2.111543274 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 35.02589871 |
| Sum | 6206835 |
| Variance | 678.1206508 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2687305 | |
| 2 | 189624 | 6.5% |
| 3 | 35645 | 1.2% |
| 4 | 6592 | 0.2% |
| 18 | 4305 | 0.1% |
| 5 | 2082 | 0.1% |
| 6 | 1028 | < 0.1% |
| 45 | 557 | < 0.1% |
| 7 | 514 | < 0.1% |
| 8 | 444 | < 0.1% |
| Other values (954) | 11382 | 0.4% |
| Value | Count | Frequency (%) |
| 1 | 2687305 | |
| 2 | 189624 | 6.5% |
| 3 | 35645 | 1.2% |
| 4 | 6592 | 0.2% |
| 5 | 2082 | 0.1% |
| 6 | 1028 | < 0.1% |
| 7 | 514 | < 0.1% |
| 8 | 444 | < 0.1% |
| 9 | 444 | < 0.1% |
| 10 | 358 | < 0.1% |
| Value | Count | Frequency (%) |
| 1271 | 31 | |
| 1270 | 31 | |
| 1264 | 25 | |
| 1262 | 25 | |
| 1259 | 22 | |
| 1256 | 22 | |
| 1248 | 19 | |
| 1244 | 19 | |
| 1241 | 17 | |
| 1237 | 16 |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 22.4 MiB |
| Vente | |
|---|---|
| Vente en l'état futur d'achèvement | 216603 |
| Echange | 47199 |
| Adjudication | 14722 |
| Expropriation | 8612 |
Length
| Max length | 34 |
|---|---|
| Median length | 5 |
| Mean length | 7.272363665 |
| Min length | 5 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Vente |
|---|---|
| 2nd row | Vente |
| 3rd row | Vente |
| 4th row | Vente |
| 5th row | Vente |
Common Values
| Value | Count | Frequency (%) |
| Vente | 2644109 | |
| Vente en l'état futur d'achèvement | 216603 | 7.4% |
| Echange | 47199 | 1.6% |
| Adjudication | 14722 | 0.5% |
| Expropriation | 8612 | 0.3% |
| Vente terrain à bâtir | 8233 | 0.3% |
Length
Pie chart
| Value | Count | Frequency (%) |
| vente | 2868945 | |
| d'achèvement | 216603 | 5.7% |
| futur | 216603 | 5.7% |
| l'état | 216603 | 5.7% |
| en | 216603 | 5.7% |
| echange | 47199 | 1.2% |
| adjudication | 14722 | 0.4% |
| expropriation | 8612 | 0.2% |
| bâtir | 8233 | 0.2% |
| à | 8233 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 119790 |
|---|---|
| Distinct (%) | 4.1% |
| Missing | 36638 |
| Missing (%) | 1.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1023719.823 |
| Minimum | 0.06 |
|---|---|
| Maximum | 396000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.4 MiB |
Quantile statistics
| Minimum | 0.06 |
|---|---|
| 5-th percentile | 2000 |
| Q1 | 50500 |
| median | 135000 |
| Q3 | 242210.9 |
| 95-th percentile | 1200000 |
| Maximum | 396000000 |
| Range | 395999999.9 |
| Interquartile range (IQR) | 191710.9 |
Descriptive statistics
| Standard deviation | 8817792.946 |
|---|---|
| Coefficient of variation (CV) | 8.613482664 |
| Kurtosis | 360.6564053 |
| Mean | 1023719.823 |
| Median Absolute Deviation (MAD) | 92000 |
| Skewness | 17.37404568 |
| Sum | 2.97169485 × 1012 |
| Variance | 7.775347243 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100000 | 27965 | 1.0% |
| 150000 | 26475 | 0.9% |
| 1 | 25541 | 0.9% |
| 120000 | 24934 | 0.8% |
| 80000 | 23571 | 0.8% |
| 50000 | 22756 | 0.8% |
| 110000 | 21872 | 0.7% |
| 60000 | 21730 | 0.7% |
| 130000 | 21655 | 0.7% |
| 90000 | 21210 | 0.7% |
| Other values (119780) | 2665131 | |
| (Missing) | 36638 | 1.2% |
| Value | Count | Frequency (%) |
| 0.06 | 5 | < 0.1% |
| 0.09 | 1 | < 0.1% |
| 0.1 | 4 | < 0.1% |
| 0.12 | 1 | < 0.1% |
| 0.15 | 897 | |
| 0.16 | 11 | < 0.1% |
| 0.18 | 294 | < 0.1% |
| 0.19 | 8 | < 0.1% |
| 0.2 | 18 | < 0.1% |
| 0.23 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 396000000 | 8 | < 0.1% |
| 394377152 | 19 | |
| 378000000 | 1 | < 0.1% |
| 372100416 | 3 | < 0.1% |
| 333760000 | 34 | |
| 330000000 | 3 | < 0.1% |
| 285600000 | 1 | < 0.1% |
| 267611000 | 24 | |
| 264831008 | 1 | < 0.1% |
| 251480768 | 17 |
| Distinct | 6938 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 1257084 |
| Missing (%) | 42.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 813.2028389 |
| Minimum | 1 |
|---|---|
| Maximum | 9999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 25 |
| Q3 | 99 |
| 95-th percentile | 5800 |
| Maximum | 9999 |
| Range | 9998 |
| Interquartile range (IQR) | 91 |
Descriptive statistics
| Standard deviation | 2166.810145 |
|---|---|
| Coefficient of variation (CV) | 2.664538343 |
| Kurtosis | 6.704854514 |
| Mean | 813.2028389 |
| Median Absolute Deviation (MAD) | 21 |
| Skewness | 2.808791473 |
| Sum | 1368127577 |
| Variance | 4695066.204 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 74899 | 2.5% |
| 2 | 66997 | 2.3% |
| 3 | 53312 | 1.8% |
| 4 | 51751 | 1.8% |
| 5 | 48119 | 1.6% |
| 6 | 47861 | 1.6% |
| 7 | 42437 | 1.4% |
| 8 | 40663 | 1.4% |
| 10 | 37255 | 1.3% |
| 9 | 36216 | 1.2% |
| Other values (6928) | 1182884 | |
| (Missing) | 1257084 |
| Value | Count | Frequency (%) |
| 1 | 74899 | |
| 2 | 66997 | |
| 3 | 53312 | |
| 4 | 51751 | |
| 5 | 48119 | |
| 6 | 47861 | |
| 7 | 42437 | |
| 8 | 40663 | |
| 9 | 36216 | |
| 10 | 37255 |
| Value | Count | Frequency (%) |
| 9999 | 317 | |
| 9998 | 41 | < 0.1% |
| 9997 | 6 | < 0.1% |
| 9996 | 9 | < 0.1% |
| 9995 | 10 | < 0.1% |
| 9994 | 12 | < 0.1% |
| 9993 | 5 | < 0.1% |
| 9992 | 3 | < 0.1% |
| 9991 | 31 | < 0.1% |
| 9990 | 6 | < 0.1% |
| Distinct | 40 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2816323 |
| Missing (%) | 95.8% |
| Memory size | 22.4 MiB |
| B | |
|---|---|
| A | |
| T | |
| F | |
| C | 4046 |
| Other values (35) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | C |
|---|---|
| 2nd row | B |
| 3rd row | D |
| 4th row | D |
| 5th row | B |
Common Values
| Value | Count | Frequency (%) |
| B | 69866 | 2.4% |
| A | 20277 | 0.7% |
| T | 10634 | 0.4% |
| F | 10584 | 0.4% |
| C | 4046 | 0.1% |
| D | 1962 | 0.1% |
| E | 1072 | < 0.1% |
| Q | 839 | < 0.1% |
| P | 712 | < 0.1% |
| G | 500 | < 0.1% |
| Other values (30) | 2663 | 0.1% |
| (Missing) | 2816323 |
Length
| Value | Count | Frequency (%) |
| b | 69866 | |
| a | 20277 | 16.5% |
| t | 10634 | 8.6% |
| f | 10584 | 8.6% |
| c | 4046 | 3.3% |
| d | 1962 | 1.6% |
| e | 1072 | 0.9% |
| q | 839 | 0.7% |
| p | 712 | 0.6% |
| g | 500 | 0.4% |
| Other values (27) | 2663 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 445200 |
|---|---|
| Distinct (%) | 15.3% |
| Missing | 23338 |
| Missing (%) | 0.8% |
| Memory size | 22.4 MiB |
| LE VILLAGE | 28548 |
|---|---|
| LE BOURG | 24988 |
| GR GRANDE RUE | 5370 |
| RUE DE LA REPUBLIQUE | 5104 |
| RUE JEAN JAURES | 5043 |
| Other values (445195) |
Length
| Max length | 31 |
|---|---|
| Median length | 14 |
| Mean length | 14.61219969 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 154082 ? |
|---|---|
| Unique (%) | 5.3% |
Sample
| 1st row | RUE TONY REVILLON |
|---|---|
| 2nd row | LES BROTTEAUX |
| 3rd row | LES BROTTEAUX |
| 4th row | LES BROTTEAUX |
| 5th row | LES BROTTEAUX |
Common Values
| Value | Count | Frequency (%) |
| LE VILLAGE | 28548 | 1.0% |
| LE BOURG | 24988 | 0.9% |
| GR GRANDE RUE | 5370 | 0.2% |
| RUE DE LA REPUBLIQUE | 5104 | 0.2% |
| RUE JEAN JAURES | 5043 | 0.2% |
| RUE PASTEUR | 4931 | 0.2% |
| AV JEAN JAURES | 4435 | 0.2% |
| AV DE LA REPUBLIQUE | 4152 | 0.1% |
| RUE VICTOR HUGO | 4147 | 0.1% |
| RUE DE PARIS | 3994 | 0.1% |
| Other values (445190) | 2825428 | |
| (Missing) | 23338 | 0.8% |
Length
| Value | Count | Frequency (%) |
| rue | 946585 | 11.4% |
| de | 626364 | 7.5% |
| la | 433378 | 5.2% |
| du | 289150 | 3.5% |
| le | 260414 | 3.1% |
| des | 241579 | 2.9% |
| av | 220383 | 2.6% |
| les | 208878 | 2.5% |
| che | 90125 | 1.1% |
| rte | 83050 | 1.0% |
| Other values (189414) | 4924560 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 14726 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 23310 |
| Missing (%) | 0.8% |
| Memory size | 22.4 MiB |
| B003 | 14637 |
|---|---|
| B006 | 14144 |
| B005 | 14034 |
| B002 | 13888 |
| B012 | 13788 |
| Other values (14721) |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1170 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0560 |
|---|---|
| 2nd row | B011 |
| 3rd row | B011 |
| 4th row | B011 |
| 5th row | B011 |
Common Values
| Value | Count | Frequency (%) |
| B003 | 14637 | 0.5% |
| B006 | 14144 | 0.5% |
| B005 | 14034 | 0.5% |
| B002 | 13888 | 0.5% |
| B012 | 13788 | 0.5% |
| B011 | 13615 | 0.5% |
| B001 | 13527 | 0.5% |
| B009 | 13416 | 0.5% |
| B008 | 13406 | 0.5% |
| B015 | 13288 | 0.5% |
| Other values (14716) | 2778425 | |
| (Missing) | 23310 | 0.8% |
Length
| Value | Count | Frequency (%) |
| b003 | 14637 | 0.5% |
| b006 | 14144 | 0.5% |
| b005 | 14034 | 0.5% |
| b002 | 13888 | 0.5% |
| b012 | 13788 | 0.5% |
| b011 | 13615 | 0.5% |
| b001 | 13527 | 0.5% |
| b009 | 13416 | 0.5% |
| b008 | 13406 | 0.5% |
| b015 | 13288 | 0.5% |
| Other values (14716) | 2778425 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 5863 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 23494 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50558.51963 |
| Minimum | 1000 |
|---|---|
| Maximum | 97490 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.4 MiB |
Quantile statistics
| Minimum | 1000 |
|---|---|
| 5-th percentile | 6670 |
| Q1 | 29250 |
| median | 49370 |
| Q3 | 75014 |
| 95-th percentile | 93160 |
| Maximum | 97490 |
| Range | 96490 |
| Interquartile range (IQR) | 45764 |
Descriptive statistics
| Standard deviation | 27494.99945 |
|---|---|
| Coefficient of variation (CV) | 0.5438252476 |
| Kurtosis | -1.206952752 |
| Mean | 50558.51963 |
| Median Absolute Deviation (MAD) | 24040 |
| Skewness | -0.009250483748 |
| Sum | 1.474278343 × 1011 |
| Variance | 755974994.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 35000 | 6801 | 0.2% |
| 18000 | 6492 | 0.2% |
| 21000 | 6229 | 0.2% |
| 69100 | 6138 | 0.2% |
| 92110 | 6041 | 0.2% |
| 31200 | 5894 | 0.2% |
| 92150 | 5793 | 0.2% |
| 75016 | 5789 | 0.2% |
| 75015 | 5757 | 0.2% |
| 54000 | 5595 | 0.2% |
| Other values (5853) | 2855455 | |
| (Missing) | 23494 | 0.8% |
| Value | Count | Frequency (%) |
| 1000 | 1625 | |
| 1090 | 346 | < 0.1% |
| 1100 | 883 | |
| 1110 | 375 | < 0.1% |
| 1120 | 709 | |
| 1130 | 306 | < 0.1% |
| 1140 | 439 | < 0.1% |
| 1150 | 907 | |
| 1160 | 696 | |
| 1170 | 1236 |
| Value | Count | Frequency (%) |
| 97490 | 1543 | |
| 97480 | 648 | |
| 97470 | 328 | < 0.1% |
| 97460 | 462 | < 0.1% |
| 97450 | 217 | < 0.1% |
| 97442 | 55 | < 0.1% |
| 97441 | 220 | < 0.1% |
| 97440 | 354 | < 0.1% |
| 97439 | 53 | < 0.1% |
| 97438 | 426 | < 0.1% |
| Distinct | 30417 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 22.4 MiB |
| Toulouse | 23731 |
|---|---|
| Nantes | 15689 |
| Bordeaux | 14566 |
| Nice | 14181 |
| Montpellier | 13813 |
| Other values (30412) |
Length
| Max length | 45 |
|---|---|
| Median length | 10 |
| Mean length | 11.87023887 |
| Min length | 2 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 378 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Saint-Laurent-sur-Saône |
|---|---|
| 2nd row | Varambon |
| 3rd row | Varambon |
| 4th row | Varambon |
| 5th row | Varambon |
Common Values
| Value | Count | Frequency (%) |
| Toulouse | 23731 | 0.8% |
| Nantes | 15689 | 0.5% |
| Bordeaux | 14566 | 0.5% |
| Nice | 14181 | 0.5% |
| Montpellier | 13813 | 0.5% |
| Rennes | 10382 | 0.4% |
| Lille | 10349 | 0.4% |
| Nîmes | 6844 | 0.2% |
| Angers | 6771 | 0.2% |
| Bourges | 6492 | 0.2% |
| Other values (30407) | 2816660 |
Length
| Value | Count | Frequency (%) |
| arrondissement | 106553 | 3.1% |
| la | 90796 | 2.7% |
| le | 78397 | 2.3% |
| paris | 58411 | 1.7% |
| les | 30835 | 0.9% |
| marseille | 28187 | 0.8% |
| toulouse | 23731 | 0.7% |
| lyon | 19955 | 0.6% |
| nantes | 15689 | 0.5% |
| bordeaux | 14566 | 0.4% |
| Other values (30324) | 2940145 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
ancien_code_commune
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 1783 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 2847076 |
| Missing (%) | 96.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 52147.84335 |
| Minimum | 1025 |
|---|---|
| Maximum | 95308 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.4 MiB |
Quantile statistics
| Minimum | 1025 |
|---|---|
| 5-th percentile | 14126 |
| Q1 | 28103 |
| median | 50097 |
| Q3 | 74010 |
| 95-th percentile | 90068 |
| Maximum | 95308 |
| Range | 94283 |
| Interquartile range (IQR) | 45907 |
Descriptive statistics
| Standard deviation | 25880.59711 |
|---|---|
| Coefficient of variation (CV) | 0.4962927601 |
| Kurtosis | -1.169859044 |
| Mean | 52147.84335 |
| Median Absolute Deviation (MAD) | 23913 |
| Skewness | -0.1370263323 |
| Sum | 4818565021 |
| Variance | 669805306.7 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 74010 | 6204 | 0.2% |
| 91228 | 1554 | 0.1% |
| 85194 | 1483 | 0.1% |
| 93070 | 1362 | < 0.1% |
| 78551 | 1222 | < 0.1% |
| 95306 | 1023 | < 0.1% |
| 73257 | 984 | < 0.1% |
| 85166 | 934 | < 0.1% |
| 85060 | 795 | < 0.1% |
| 74282 | 668 | < 0.1% |
| Other values (1773) | 76173 | 2.6% |
| (Missing) | 2847076 |
| Value | Count | Frequency (%) |
| 1025 | 229 | |
| 1033 | 518 | |
| 1036 | 158 | < 0.1% |
| 1059 | 4 | < 0.1% |
| 1091 | 215 | |
| 1095 | 77 | < 0.1% |
| 1097 | 22 | < 0.1% |
| 1098 | 32 | < 0.1% |
| 1119 | 12 | < 0.1% |
| 1122 | 23 | < 0.1% |
| Value | Count | Frequency (%) |
| 95308 | 12 | < 0.1% |
| 95306 | 1023 | |
| 95259 | 3 | < 0.1% |
| 95040 | 8 | < 0.1% |
| 93070 | 1362 | |
| 91390 | 131 | < 0.1% |
| 91228 | 1554 | |
| 91222 | 11 | < 0.1% |
| 91182 | 453 | < 0.1% |
| 90073 | 22 | < 0.1% |
| Distinct | 1772 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 2847076 |
| Missing (%) | 96.9% |
| Memory size | 22.4 MiB |
| Annecy | 6204 |
|---|---|
| Évry | 1554 |
| Les Sables-d'Olonne | 1483 |
| Saint-Ouen | 1362 |
| Saint-Germain-en-Laye | 1222 |
| Other values (1767) |
Length
| Max length | 44 |
|---|---|
| Median length | 11 |
| Mean length | 12.41290232 |
| Min length | 2 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 62 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Cras-sur-Reyssouze |
|---|---|
| 2nd row | Cras-sur-Reyssouze |
| 3rd row | Cras-sur-Reyssouze |
| 4th row | Étrez |
| 5th row | Bâgé-la-Ville |
Common Values
| Value | Count | Frequency (%) |
| Annecy | 6204 | 0.2% |
| Évry | 1554 | 0.1% |
| Les Sables-d'Olonne | 1483 | 0.1% |
| Saint-Ouen | 1362 | < 0.1% |
| Saint-Germain-en-Laye | 1222 | < 0.1% |
| Herblay | 1023 | < 0.1% |
| Les Belleville | 984 | < 0.1% |
| Olonne-sur-Mer | 934 | < 0.1% |
| Château-d'Olonne | 795 | < 0.1% |
| Thorens-Glières | 668 | < 0.1% |
| Other values (1762) | 76173 | 2.6% |
| (Missing) | 2847076 |
Length
| Value | Count | Frequency (%) |
| annecy | 6204 | 5.9% |
| les | 3989 | 3.8% |
| la | 3057 | 2.9% |
| le | 2864 | 2.7% |
| évry | 1554 | 1.5% |
| sables-d'olonne | 1483 | 1.4% |
| saint-ouen | 1362 | 1.3% |
| belleville | 1261 | 1.2% |
| saint-germain-en-laye | 1222 | 1.2% |
| herblay | 1023 | 1.0% |
| Other values (1776) | 81937 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 1820762 |
|---|---|
| Distinct (%) | 61.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 22.4 MiB |
| 92073000AH0230 | 972 |
|---|---|
| 13106000BC0020 | 930 |
| 93010000AM0299 | 757 |
| 95277000ZS1580 | 728 |
| 92073000AR0385 | 684 |
| Other values (1820757) |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 14 |
| Min length | 14 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1456963 ? |
|---|---|
| Unique (%) | 49.6% |
Sample
| 1st row | 013700000A0253 |
|---|---|
| 2nd row | 014300000C1043 |
| 3rd row | 014300000C1157 |
| 4th row | 014300000C1159 |
| 5th row | 014300000C1160 |
Common Values
| Value | Count | Frequency (%) |
| 92073000AH0230 | 972 | < 0.1% |
| 13106000BC0020 | 930 | < 0.1% |
| 93010000AM0299 | 757 | < 0.1% |
| 95277000ZS1580 | 728 | < 0.1% |
| 92073000AR0385 | 684 | < 0.1% |
| 94041000AM0078 | 656 | < 0.1% |
| 940160000N0096 | 650 | < 0.1% |
| 77243000AS0132 | 646 | < 0.1% |
| 30189000EM0022 | 637 | < 0.1% |
| 84007000DP0120 | 608 | < 0.1% |
| Other values (1820752) | 2932210 |
Length
| Value | Count | Frequency (%) |
| 92073000ah0230 | 972 | < 0.1% |
| 13106000bc0020 | 930 | < 0.1% |
| 93010000am0299 | 757 | < 0.1% |
| 95277000zs1580 | 728 | < 0.1% |
| 92073000ar0385 | 684 | < 0.1% |
| 94041000am0078 | 656 | < 0.1% |
| 940160000n0096 | 650 | < 0.1% |
| 77243000as0132 | 646 | < 0.1% |
| 30189000em0022 | 637 | < 0.1% |
| 84007000dp0120 | 608 | < 0.1% |
| Other values (1820752) | 2932210 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 25690 |
|---|---|
| Distinct (%) | 81.2% |
| Missing | 2907846 |
| Missing (%) | 98.9% |
| Memory size | 22.4 MiB |
| 85166000AC1258 | 82 |
|---|---|
| 91182000AB0141 | 61 |
| 78524000AB0102 | 54 |
| 85166000AW0368 | 43 |
| 85060000AI0349 | 40 |
| Other values (25685) |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 14 |
| Min length | 14 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 22020 ? |
|---|---|
| Unique (%) | 69.6% |
Sample
| 1st row | 01154000ZA0002 |
|---|---|
| 2nd row | 011440000B0330 |
| 3rd row | 011440000B0332 |
| 4th row | 011440000B0625 |
| 5th row | 011440000B0635 |
Common Values
| Value | Count | Frequency (%) |
| 85166000AC1258 | 82 | < 0.1% |
| 91182000AB0141 | 61 | < 0.1% |
| 78524000AB0102 | 54 | < 0.1% |
| 85166000AW0368 | 43 | < 0.1% |
| 85060000AI0349 | 40 | < 0.1% |
| 91182000AN0528 | 34 | < 0.1% |
| 78524000AB0007 | 34 | < 0.1% |
| 782510000B0256 | 28 | < 0.1% |
| 78524000AB0152 | 27 | < 0.1% |
| 01091458ZB0600 | 26 | < 0.1% |
| Other values (25680) | 31203 | 1.1% |
| (Missing) | 2907846 |
Length
| Value | Count | Frequency (%) |
| 85166000ac1258 | 82 | 0.3% |
| 91182000ab0141 | 61 | 0.2% |
| 78524000ab0102 | 54 | 0.2% |
| 85166000aw0368 | 43 | 0.1% |
| 85060000ai0349 | 40 | 0.1% |
| 91182000an0528 | 34 | 0.1% |
| 78524000ab0007 | 34 | 0.1% |
| 782510000b0256 | 28 | 0.1% |
| 78524000ab0152 | 27 | 0.1% |
| 01091458zb0600 | 26 | 0.1% |
| Other values (25680) | 31203 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
lot1_surface_carrez
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGSKEWED| Distinct | 17326 |
|---|---|
| Distinct (%) | 7.3% |
| Missing | 2700907 |
| Missing (%) | 91.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 65.50236923 |
| Minimum | 0.01 |
|---|---|
| Maximum | 9999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.4 MiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 16.9 |
| Q1 | 33.85 |
| median | 53.5 |
| Q3 | 73.67 |
| 95-th percentile | 117.44 |
| Maximum | 9999 |
| Range | 9998.99 |
| Interquartile range (IQR) | 39.82 |
Descriptive statistics
| Standard deviation | 160.7394337 |
|---|---|
| Coefficient of variation (CV) | 2.453948393 |
| Kurtosis | 946.3559606 |
| Mean | 65.50236923 |
| Median Absolute Deviation (MAD) | 19.86 |
| Skewness | 27.63008812 |
| Sum | 15626965.73 |
| Variance | 25837.16555 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 328 | < 0.1% |
| 12.5 | 317 | < 0.1% |
| 15 | 300 | < 0.1% |
| 67 | 289 | < 0.1% |
| 60 | 282 | < 0.1% |
| 30 | 279 | < 0.1% |
| 65 | 275 | < 0.1% |
| 45 | 270 | < 0.1% |
| 40 | 267 | < 0.1% |
| 70 | 257 | < 0.1% |
| Other values (17316) | 235707 | 8.0% |
| (Missing) | 2700907 |
| Value | Count | Frequency (%) |
| 0.01 | 1 | < 0.1% |
| 0.3 | 3 | < 0.1% |
| 0.36 | 7 | < 0.1% |
| 0.7 | 2 | < 0.1% |
| 0.73 | 2 | < 0.1% |
| 0.77 | 1 | < 0.1% |
| 0.8 | 3 | < 0.1% |
| 0.9 | 1 | < 0.1% |
| 0.99 | 1 | < 0.1% |
| 1 | 27 |
| Value | Count | Frequency (%) |
| 9999 | 2 | |
| 9164 | 1 | |
| 8432 | 1 | |
| 8154 | 1 | |
| 7933 | 1 | |
| 7587 | 1 | |
| 7549 | 1 | |
| 7500 | 1 | |
| 7256 | 1 | |
| 6969 | 1 |
lot2_surface_carrez
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGSKEWED| Distinct | 11781 |
|---|---|
| Distinct (%) | 19.6% |
| Missing | 2879425 |
| Missing (%) | 98.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 65.99463807 |
| Minimum | 0.01 |
|---|---|
| Maximum | 6792 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.4 MiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 22.85 |
| Q1 | 43.03 |
| median | 61.07 |
| Q3 | 76.31 |
| 95-th percentile | 110.63 |
| Maximum | 6792 |
| Range | 6791.99 |
| Interquartile range (IQR) | 33.28 |
Descriptive statistics
| Standard deviation | 107.9081452 |
|---|---|
| Coefficient of variation (CV) | 1.635104736 |
| Kurtosis | 1434.387619 |
| Mean | 65.99463807 |
| Median Absolute Deviation (MAD) | 16.73 |
| Skewness | 33.39231458 |
| Sum | 3963176 |
| Variance | 11644.16781 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 65 | 80 | < 0.1% |
| 40 | 71 | < 0.1% |
| 62 | 70 | < 0.1% |
| 67 | 67 | < 0.1% |
| 63 | 66 | < 0.1% |
| 70 | 65 | < 0.1% |
| 64 | 64 | < 0.1% |
| 72 | 59 | < 0.1% |
| 55 | 56 | < 0.1% |
| 57 | 55 | < 0.1% |
| Other values (11771) | 59400 | 2.0% |
| (Missing) | 2879425 |
| Value | Count | Frequency (%) |
| 0.01 | 1 | |
| 0.36 | 1 | |
| 0.51 | 1 | |
| 0.57 | 1 | |
| 0.7 | 2 | |
| 0.73 | 2 | |
| 0.75 | 2 | |
| 0.83 | 1 | |
| 0.9 | 1 | |
| 0.93 | 1 |
| Value | Count | Frequency (%) |
| 6792 | 1 | |
| 6571 | 1 | |
| 6554 | 1 | |
| 5738 | 1 | |
| 5532 | 1 | |
| 4980 | 1 | |
| 4491 | 1 | |
| 3888 | 1 | |
| 3803 | 1 | |
| 3606 | 1 |
lot3_surface_carrez
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGSKEWED| Distinct | 4365 |
|---|---|
| Distinct (%) | 74.1% |
| Missing | 2933591 |
| Missing (%) | 99.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 74.47500425 |
| Minimum | 0.27 |
|---|---|
| Maximum | 7783.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.4 MiB |
Quantile statistics
| Minimum | 0.27 |
|---|---|
| 5-th percentile | 12.5 |
| Q1 | 38.2 |
| median | 61.56 |
| Q3 | 86.29 |
| 95-th percentile | 157.684 |
| Maximum | 7783.9 |
| Range | 7783.63 |
| Interquartile range (IQR) | 48.09 |
Descriptive statistics
| Standard deviation | 133.2677603 |
|---|---|
| Coefficient of variation (CV) | 1.789429375 |
| Kurtosis | 2026.815321 |
| Mean | 74.47500425 |
| Median Absolute Deviation (MAD) | 23.93 |
| Skewness | 38.11228458 |
| Sum | 438434.35 |
| Variance | 17760.29593 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7.98 | 21 | < 0.1% |
| 12.5 | 20 | < 0.1% |
| 56.2 | 18 | < 0.1% |
| 80 | 13 | < 0.1% |
| 12 | 11 | < 0.1% |
| 35 | 10 | < 0.1% |
| 30 | 10 | < 0.1% |
| 40 | 9 | < 0.1% |
| 20 | 8 | < 0.1% |
| 22.92 | 8 | < 0.1% |
| Other values (4355) | 5759 | 0.2% |
| (Missing) | 2933591 |
| Value | Count | Frequency (%) |
| 0.27 | 1 | < 0.1% |
| 0.28 | 1 | < 0.1% |
| 0.36 | 2 | |
| 0.74 | 1 | < 0.1% |
| 0.75 | 1 | < 0.1% |
| 0.83 | 1 | < 0.1% |
| 0.85 | 3 | |
| 1 | 2 | |
| 1.1 | 1 | < 0.1% |
| 1.35 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7783.9 | 1 | |
| 3916.3 | 1 | |
| 1285 | 1 | |
| 1239.3 | 1 | |
| 1217.7 | 1 | |
| 1206.2 | 1 | |
| 1102.16 | 1 | |
| 1085 | 1 | |
| 1028 | 1 | |
| 884.98 | 1 |
lot4_surface_carrez
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 1258 |
|---|---|
| Distinct (%) | 86.2% |
| Missing | 2938019 |
| Missing (%) | > 99.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 82.64703221 |
| Minimum | 0.36 |
|---|---|
| Maximum | 1612 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.4 MiB |
Quantile statistics
| Minimum | 0.36 |
|---|---|
| 5-th percentile | 8.98 |
| Q1 | 30.11 |
| median | 60.85 |
| Q3 | 95.005 |
| 95-th percentile | 213.747 |
| Maximum | 1612 |
| Range | 1611.64 |
| Interquartile range (IQR) | 64.895 |
Descriptive statistics
| Standard deviation | 111.5239584 |
|---|---|
| Coefficient of variation (CV) | 1.349400643 |
| Kurtosis | 55.02835672 |
| Mean | 82.64703221 |
| Median Absolute Deviation (MAD) | 31.85 |
| Skewness | 6.170102522 |
| Sum | 120582.02 |
| Variance | 12437.59329 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.8 | 21 | < 0.1% |
| 56.82 | 10 | < 0.1% |
| 18.16 | 7 | < 0.1% |
| 12.5 | 6 | < 0.1% |
| 21 | 6 | < 0.1% |
| 14.84 | 5 | < 0.1% |
| 22.7 | 5 | < 0.1% |
| 658.07 | 5 | < 0.1% |
| 15 | 4 | < 0.1% |
| 59.99 | 4 | < 0.1% |
| Other values (1248) | 1386 | < 0.1% |
| (Missing) | 2938019 |
| Value | Count | Frequency (%) |
| 0.36 | 1 | |
| 0.7 | 1 | |
| 0.83 | 1 | |
| 1 | 1 | |
| 1.32 | 1 | |
| 1.36 | 1 | |
| 2 | 2 | |
| 2.17 | 1 | |
| 2.7 | 1 | |
| 3 | 1 |
| Value | Count | Frequency (%) |
| 1612 | 1 | < 0.1% |
| 1295.5 | 1 | < 0.1% |
| 1217.1 | 1 | < 0.1% |
| 884.98 | 2 | < 0.1% |
| 883.5 | 1 | < 0.1% |
| 881.28 | 2 | < 0.1% |
| 860 | 1 | < 0.1% |
| 727.87 | 1 | < 0.1% |
| 661.91 | 1 | < 0.1% |
| 658.07 | 5 |
lot5_surface_carrez
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 547 |
|---|---|
| Distinct (%) | 84.9% |
| Missing | 2938834 |
| Missing (%) | > 99.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 84.06465839 |
| Minimum | 1.35 |
|---|---|
| Maximum | 1328 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.4 MiB |
Quantile statistics
| Minimum | 1.35 |
|---|---|
| 5-th percentile | 6.66 |
| Q1 | 21.8 |
| median | 56.53 |
| Q3 | 99.4225 |
| 95-th percentile | 265.303 |
| Maximum | 1328 |
| Range | 1326.65 |
| Interquartile range (IQR) | 77.6225 |
Descriptive statistics
| Standard deviation | 107.8456248 |
|---|---|
| Coefficient of variation (CV) | 1.282888992 |
| Kurtosis | 36.94631522 |
| Mean | 84.06465839 |
| Median Absolute Deviation (MAD) | 35.845 |
| Skewness | 4.736788302 |
| Sum | 54137.64 |
| Variance | 11630.67879 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.66 | 21 | < 0.1% |
| 34.47 | 7 | < 0.1% |
| 21.44 | 6 | < 0.1% |
| 22.7 | 5 | < 0.1% |
| 59.92 | 4 | < 0.1% |
| 8 | 4 | < 0.1% |
| 2 | 3 | < 0.1% |
| 60 | 3 | < 0.1% |
| 70 | 3 | < 0.1% |
| 12 | 3 | < 0.1% |
| Other values (537) | 585 | < 0.1% |
| (Missing) | 2938834 |
| Value | Count | Frequency (%) |
| 1.35 | 1 | < 0.1% |
| 1.63 | 1 | < 0.1% |
| 2 | 3 | |
| 2.88 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 3.7 | 1 | < 0.1% |
| 3.8 | 1 | < 0.1% |
| 3.85 | 1 | < 0.1% |
| 3.92 | 1 | < 0.1% |
| 4.04 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1328 | 1 | |
| 860 | 1 | |
| 699 | 1 | |
| 658.07 | 2 | |
| 650.02 | 1 | |
| 548.67 | 1 | |
| 543.72 | 1 | |
| 475.6 | 1 | |
| 418.57 | 1 | |
| 401.74 | 1 |
| Distinct | 90 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3904628645 |
| Minimum | 0 |
|---|---|
| Maximum | 468 |
| Zeros | 2050131 |
| Zeros (%) | 69.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 468 |
| Range | 468 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.8944235613 |
|---|---|
| Coefficient of variation (CV) | 2.2906751 |
| Kurtosis | 30236.37693 |
| Mean | 0.3904628645 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 82.80203865 |
| Sum | 1147757 |
| Variance | 0.799993507 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2050131 | |
| 1 | 697791 | 23.7% |
| 2 | 159643 | 5.4% |
| 3 | 21069 | 0.7% |
| 4 | 5672 | 0.2% |
| 5 | 2122 | 0.1% |
| 6 | 1068 | < 0.1% |
| 7 | 515 | < 0.1% |
| 8 | 365 | < 0.1% |
| 9 | 205 | < 0.1% |
| Other values (80) | 897 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2050131 | |
| 1 | 697791 | 23.7% |
| 2 | 159643 | 5.4% |
| 3 | 21069 | 0.7% |
| 4 | 5672 | 0.2% |
| 5 | 2122 | 0.1% |
| 6 | 1068 | < 0.1% |
| 7 | 515 | < 0.1% |
| 8 | 365 | < 0.1% |
| 9 | 205 | < 0.1% |
| Value | Count | Frequency (%) |
| 468 | 1 | |
| 223 | 1 | |
| 184 | 1 | |
| 170 | 1 | |
| 150 | 1 | |
| 149 | 1 | |
| 147 | 1 | |
| 121 | 1 | |
| 120 | 2 | |
| 118 | 1 |
code_type_local
Categorical
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1350027 |
| Missing (%) | 45.9% |
| Memory size | 22.4 MiB |
| 1.0 | |
|---|---|
| 2.0 | |
| 3.0 | |
| 4.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 4.0 |
| 3rd row | 4.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 571095 | |
| 2.0 | 520526 | 17.7% |
| 3.0 | 394511 | 13.4% |
| 4.0 | 103319 | 3.5% |
| (Missing) | 1350027 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1.0 | 571095 | |
| 2.0 | 520526 | |
| 3.0 | 394511 | |
| 4.0 | 103319 | 6.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1350027 |
| Missing (%) | 45.9% |
| Memory size | 22.4 MiB |
| Maison | |
|---|---|
| Appartement | |
| Dépendance | |
| Local industriel. commercial ou assimilé |
Length
| Max length | 40 |
|---|---|
| Median length | 10 |
| Mean length | 10.84036312 |
| Min length | 6 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Appartement |
|---|---|
| 2nd row | Local industriel. commercial ou assimilé |
| 3rd row | Local industriel. commercial ou assimilé |
| 4th row | Maison |
| 5th row | Maison |
Common Values
| Value | Count | Frequency (%) |
| Maison | 571095 | |
| Appartement | 520526 | 17.7% |
| Dépendance | 394511 | 13.4% |
| Local industriel. commercial ou assimilé | 103319 | 3.5% |
| (Missing) | 1350027 |
Length
Pie chart
| Value | Count | Frequency (%) |
| maison | 571095 | |
| appartement | 520526 | |
| dépendance | 394511 | |
| assimilé | 103319 | 5.2% |
| ou | 103319 | 5.2% |
| commercial | 103319 | 5.2% |
| industriel | 103319 | 5.2% |
| local | 103319 | 5.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
surface_reelle_bati
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGSKEWED| Distinct | 4617 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 1763264 |
| Missing (%) | 60.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 116.3063006 |
| Minimum | 1 |
|---|---|
| Maximum | 271450 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 22 |
| Q1 | 50 |
| median | 75 |
| Q3 | 103 |
| 95-th percentile | 187 |
| Maximum | 271450 |
| Range | 271449 |
| Interquartile range (IQR) | 53 |
Descriptive statistics
| Standard deviation | 723.634591 |
|---|---|
| Coefficient of variation (CV) | 6.221800431 |
| Kurtosis | 22869.31238 |
| Mean | 116.3063006 |
| Median Absolute Deviation (MAD) | 26 |
| Skewness | 101.1116364 |
| Sum | 136801099 |
| Variance | 523647.0212 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 80 | 23116 | 0.8% |
| 60 | 22263 | 0.8% |
| 70 | 20895 | 0.7% |
| 90 | 20405 | 0.7% |
| 100 | 17568 | 0.6% |
| 50 | 17380 | 0.6% |
| 65 | 16321 | 0.6% |
| 40 | 15884 | 0.5% |
| 75 | 14777 | 0.5% |
| 45 | 13712 | 0.5% |
| Other values (4607) | 993893 | |
| (Missing) | 1763264 |
| Value | Count | Frequency (%) |
| 1 | 287 | < 0.1% |
| 2 | 165 | < 0.1% |
| 3 | 155 | < 0.1% |
| 4 | 143 | < 0.1% |
| 5 | 177 | < 0.1% |
| 6 | 305 | < 0.1% |
| 7 | 268 | < 0.1% |
| 8 | 490 | < 0.1% |
| 9 | 974 | < 0.1% |
| 10 | 3053 |
| Value | Count | Frequency (%) |
| 271450 | 1 | |
| 143568 | 1 | |
| 127139 | 1 | |
| 116201 | 1 | |
| 114600 | 1 | |
| 100200 | 1 | |
| 84320 | 1 | |
| 82896 | 1 | |
| 82598 | 2 | |
| 79462 | 1 |
nombre_pieces_principales
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1352568 |
| Missing (%) | 46.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.357246473 |
| Minimum | 0 |
|---|---|
| Maximum | 96 |
| Zeros | 504465 |
| Zeros (%) | 17.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 96 |
| Range | 96 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.077591811 |
|---|---|
| Coefficient of variation (CV) | 0.8813638436 |
| Kurtosis | 10.67712223 |
| Mean | 2.357246473 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.8073559444 |
| Sum | 3740738 |
| Variance | 4.316387735 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 504465 | 17.2% |
| 3 | 266057 | 9.1% |
| 4 | 264996 | 9.0% |
| 2 | 191610 | 6.5% |
| 5 | 153481 | 5.2% |
| 1 | 113354 | 3.9% |
| 6 | 58553 | 2.0% |
| 7 | 21213 | 0.7% |
| 8 | 7622 | 0.3% |
| 9 | 2867 | 0.1% |
| Other values (41) | 2692 | 0.1% |
| (Missing) | 1352568 |
| Value | Count | Frequency (%) |
| 0 | 504465 | |
| 1 | 113354 | 3.9% |
| 2 | 191610 | 6.5% |
| 3 | 266057 | |
| 4 | 264996 | |
| 5 | 153481 | 5.2% |
| 6 | 58553 | 2.0% |
| 7 | 21213 | 0.7% |
| 8 | 7622 | 0.3% |
| 9 | 2867 | 0.1% |
| Value | Count | Frequency (%) |
| 96 | 2 | |
| 81 | 1 | |
| 72 | 1 | |
| 66 | 1 | |
| 61 | 1 | |
| 60 | 1 | |
| 53 | 1 | |
| 51 | 1 | |
| 47 | 2 | |
| 45 | 1 |
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 901816 |
| Missing (%) | 30.7% |
| Memory size | 22.4 MiB |
| S | |
|---|---|
| T | |
| P | |
| AB | |
| J | |
| Other values (22) |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.209043993 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | L |
|---|---|
| 2nd row | L |
| 3rd row | L |
| 4th row | S |
| 5th row | L |
Common Values
| Value | Count | Frequency (%) |
| S | 939239 | |
| T | 305255 | 10.4% |
| P | 160055 | 5.4% |
| AB | 128001 | 4.4% |
| J | 102981 | 3.5% |
| BT | 85411 | 2.9% |
| L | 85105 | 2.9% |
| AG | 73035 | 2.5% |
| VI | 36546 | 1.2% |
| BR | 30770 | 1.0% |
| Other values (17) | 91264 | 3.1% |
| (Missing) | 901816 |
Length
| Value | Count | Frequency (%) |
| s | 939239 | |
| t | 305255 | 15.0% |
| p | 160055 | 7.9% |
| ab | 128001 | 6.3% |
| j | 102981 | 5.1% |
| bt | 85411 | 4.2% |
| l | 85105 | 4.2% |
| ag | 73035 | 3.6% |
| vi | 36546 | 1.8% |
| br | 30770 | 1.5% |
| Other values (17) | 91264 | 4.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 901816 |
| Missing (%) | 30.7% |
| Memory size | 22.4 MiB |
| sols | |
|---|---|
| terres | |
| prés | |
| terrains a bâtir | |
| jardins | |
| Other values (22) |
Length
| Max length | 19 |
|---|---|
| Median length | 4 |
| Mean length | 6.777070486 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | landes |
|---|---|
| 2nd row | landes |
| 3rd row | landes |
| 4th row | sols |
| 5th row | landes |
Common Values
| Value | Count | Frequency (%) |
| sols | 939239 | |
| terres | 305255 | 10.4% |
| prés | 160055 | 5.4% |
| terrains a bâtir | 128001 | 4.4% |
| jardins | 102981 | 3.5% |
| taillis simples | 85411 | 2.9% |
| landes | 85105 | 2.9% |
| terrains d'agrément | 73035 | 2.5% |
| vignes | 36546 | 1.2% |
| futaies résineuses | 30770 | 1.0% |
| Other values (17) | 91264 | 3.1% |
| (Missing) | 901816 |
Length
| Value | Count | Frequency (%) |
| sols | 939239 | |
| terres | 305359 | 12.1% |
| terrains | 201036 | 8.0% |
| prés | 162461 | 6.4% |
| a | 128001 | 5.1% |
| bâtir | 128001 | 5.1% |
| jardins | 102981 | 4.1% |
| taillis | 100034 | 4.0% |
| simples | 85411 | 3.4% |
| landes | 85387 | 3.4% |
| Other values (24) | 284942 | 11.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 124 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2800349 |
| Missing (%) | 95.3% |
| Memory size | 22.4 MiB |
| POTAG | |
|---|---|
| PATUR | |
| PARC | |
| PIN | |
| FRICH | |
| Other values (119) |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.459386612 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | JARD |
|---|---|
| 2nd row | ACACI |
| 3rd row | ACACI |
| 4th row | POTAG |
| 5th row | FOSSE |
Common Values
| Value | Count | Frequency (%) |
| POTAG | 28150 | 1.0% |
| PATUR | 14516 | 0.5% |
| PARC | 10980 | 0.4% |
| PIN | 10183 | 0.3% |
| FRICH | 9492 | 0.3% |
| VAOC | 7536 | 0.3% |
| IMM | 7155 | 0.2% |
| CHAT | 3953 | 0.1% |
| CHENE | 3282 | 0.1% |
| RUE | 3041 | 0.1% |
| Other values (114) | 40841 | 1.4% |
| (Missing) | 2800349 |
Length
| Value | Count | Frequency (%) |
| potag | 28150 | |
| patur | 14516 | 10.4% |
| parc | 10980 | 7.9% |
| pin | 10183 | 7.3% |
| frich | 9492 | 6.8% |
| vaoc | 7536 | 5.4% |
| imm | 7155 | 5.1% |
| chat | 3953 | 2.8% |
| chene | 3282 | 2.4% |
| rue | 3041 | 2.2% |
| Other values (114) | 40841 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 124 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2800349 |
| Missing (%) | 95.3% |
| Memory size | 22.4 MiB |
| Jardin potager | |
|---|---|
| Pâture plantée | |
| Parc | |
| Pins | |
| Friche | |
| Other values (119) |
Length
| Max length | 38 |
|---|---|
| Median length | 14 |
| Mean length | 13.32630149 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Jardin d'agrément |
|---|---|
| 2nd row | Acacias |
| 3rd row | Acacias |
| 4th row | Jardin potager |
| 5th row | Fosse |
Common Values
| Value | Count | Frequency (%) |
| Jardin potager | 28150 | 1.0% |
| Pâture plantée | 14516 | 0.5% |
| Parc | 10980 | 0.4% |
| Pins | 10183 | 0.3% |
| Friche | 9492 | 0.3% |
| Vins d'appellation d'origine contrôlée | 7536 | 0.3% |
| Dépendances d'ensemble immobilier | 7155 | 0.2% |
| Châtaigneraie | 3953 | 0.1% |
| Chênes | 3282 | 0.1% |
| Rue | 3041 | 0.1% |
| Other values (114) | 40841 | 1.4% |
| (Missing) | 2800349 |
Length
| Value | Count | Frequency (%) |
| jardin | 29525 | 11.3% |
| potager | 28150 | 10.8% |
| pâture | 14516 | 5.6% |
| plantée | 14516 | 5.6% |
| parc | 10984 | 4.2% |
| pins | 10183 | 3.9% |
| friche | 9492 | 3.6% |
| vins | 7704 | 3.0% |
| ou | 7657 | 2.9% |
| contrôlée | 7536 | 2.9% |
| Other values (159) | 120562 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 44269 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 901874 |
| Missing (%) | 30.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3075.789123 |
| Minimum | 1 |
|---|---|
| Maximum | 4301668 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 29 |
| Q1 | 235 |
| median | 630 |
| Q3 | 1980 |
| 95-th percentile | 12895 |
| Maximum | 4301668 |
| Range | 4301667 |
| Interquartile range (IQR) | 1745 |
Descriptive statistics
| Standard deviation | 15396.96176 |
|---|---|
| Coefficient of variation (CV) | 5.005857406 |
| Kurtosis | 28269.44841 |
| Mean | 3075.789123 |
| Median Absolute Deviation (MAD) | 509 |
| Skewness | 121.0253572 |
| Sum | 6267240220 |
| Variance | 237066431.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 500 | 38122 | 1.3% |
| 1000 | 18325 | 0.6% |
| 800 | 5961 | 0.2% |
| 600 | 5590 | 0.2% |
| 12 | 5010 | 0.2% |
| 400 | 4770 | 0.2% |
| 13 | 4674 | 0.2% |
| 100 | 4605 | 0.2% |
| 700 | 4602 | 0.2% |
| 300 | 4490 | 0.2% |
| Other values (44259) | 1941455 | |
| (Missing) | 901874 |
| Value | Count | Frequency (%) |
| 1 | 4187 | |
| 2 | 3408 | |
| 3 | 3305 | |
| 4 | 3435 | |
| 5 | 3491 | |
| 6 | 3401 | |
| 7 | 3240 | |
| 8 | 3355 | |
| 9 | 3187 | |
| 10 | 4321 |
| Value | Count | Frequency (%) |
| 4301668 | 6 | |
| 4188939 | 1 | < 0.1% |
| 3923036 | 1 | < 0.1% |
| 3796490 | 1 | < 0.1% |
| 3092894 | 1 | < 0.1% |
| 2633047 | 1 | < 0.1% |
| 2455315 | 1 | < 0.1% |
| 2445345 | 1 | < 0.1% |
| 2343155 | 3 | |
| 2302735 | 1 | < 0.1% |
| Distinct | 1594814 |
|---|---|
| Distinct (%) | 56.4% |
| Missing | 112870 |
| Missing (%) | 3.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.284574988 |
| Minimum | -63.149476 |
|---|---|
| Maximum | 55.828143 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 626728 |
| Negative (%) | 21.3% |
| Memory size | 22.4 MiB |
Quantile statistics
| Minimum | -63.149476 |
|---|---|
| 5-th percentile | -2.1288005 |
| Q1 | 0.33686775 |
| median | 2.359149 |
| Q3 | 4.560187 |
| 95-th percentile | 6.59459925 |
| Maximum | 55.828143 |
| Range | 118.977619 |
| Interquartile range (IQR) | 4.22331925 |
Descriptive statistics
| Standard deviation | 6.230969229 |
|---|---|
| Coefficient of variation (CV) | 2.727408494 |
| Kurtosis | 69.51076523 |
| Mean | 2.284574988 |
| Median Absolute Deviation (MAD) | 2.094442 |
| Skewness | -1.860785691 |
| Sum | 6457597.938 |
| Variance | 38.82497753 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.353493 | 930 | < 0.1% |
| 2.487316 | 757 | < 0.1% |
| 2.431922 | 728 | < 0.1% |
| 2.392866 | 656 | < 0.1% |
| 2.331678 | 650 | < 0.1% |
| 4.33429 | 637 | < 0.1% |
| 4.827475 | 608 | < 0.1% |
| 2.372733 | 552 | < 0.1% |
| -0.519837 | 510 | < 0.1% |
| 2.12356 | 492 | < 0.1% |
| Other values (1594804) | 2820088 | |
| (Missing) | 112870 | 3.8% |
| Value | Count | Frequency (%) |
| -63.149476 | 3 | < 0.1% |
| -63.148901 | 8 | |
| -63.148504 | 4 | |
| -63.148481 | 4 | |
| -63.140541 | 4 | |
| -63.13953 | 6 | |
| -63.134939 | 1 | < 0.1% |
| -63.134652 | 1 | < 0.1% |
| -63.133647 | 2 | < 0.1% |
| -63.133337 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 55.828143 | 1 | |
| 55.827816 | 1 | |
| 55.826864 | 1 | |
| 55.826679 | 1 | |
| 55.826248 | 1 | |
| 55.824185 | 1 | |
| 55.823709 | 1 | |
| 55.823607 | 1 | |
| 55.823403 | 1 | |
| 55.822913 | 1 |
| Distinct | 1549543 |
|---|---|
| Distinct (%) | 54.8% |
| Missing | 112870 |
| Missing (%) | 3.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 46.16768913 |
| Minimum | -21.386074 |
|---|---|
| Maximum | 51.082118 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 13200 |
| Negative (%) | 0.4% |
| Memory size | 22.4 MiB |
Quantile statistics
| Minimum | -21.386074 |
|---|---|
| 5-th percentile | 43.22609175 |
| Q1 | 44.7402405 |
| median | 46.7351275 |
| Q3 | 48.700214 |
| 95-th percentile | 49.8918253 |
| Maximum | 51.082118 |
| Range | 72.468192 |
| Interquartile range (IQR) | 3.9599735 |
Descriptive statistics
| Standard deviation | 5.605373036 |
|---|---|
| Coefficient of variation (CV) | 0.1214133335 |
| Kurtosis | 100.0702272 |
| Mean | 46.16768913 |
| Median Absolute Deviation (MAD) | 1.9721055 |
| Skewness | -9.041697244 |
| Sum | 130497959.4 |
| Variance | 31.42020688 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 43.388626 | 930 | < 0.1% |
| 48.898406 | 757 | < 0.1% |
| 48.986808 | 731 | < 0.1% |
| 48.811396 | 665 | < 0.1% |
| 48.786767 | 651 | < 0.1% |
| 43.823295 | 637 | < 0.1% |
| 43.953439 | 608 | < 0.1% |
| 48.974006 | 550 | < 0.1% |
| 47.487243 | 510 | < 0.1% |
| 48.864581 | 493 | < 0.1% |
| Other values (1549533) | 2820076 | |
| (Missing) | 112870 | 3.8% |
| Value | Count | Frequency (%) |
| -21.386074 | 1 | < 0.1% |
| -21.385627 | 2 | |
| -21.384815 | 4 | |
| -21.384723 | 1 | < 0.1% |
| -21.384661 | 2 | |
| -21.384582 | 1 | < 0.1% |
| -21.384489 | 1 | < 0.1% |
| -21.384447 | 1 | < 0.1% |
| -21.383468 | 1 | < 0.1% |
| -21.383255 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 51.082118 | 3 | < 0.1% |
| 51.081947 | 6 | |
| 51.081805 | 6 | |
| 51.081765 | 5 | |
| 51.08171 | 2 | < 0.1% |
| 51.081631 | 10 | |
| 51.081576 | 6 | |
| 51.081102 | 1 | < 0.1% |
| 51.080875 | 1 | < 0.1% |
| 51.080809 | 1 | < 0.1% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.